OPEN PROBLEMS IN ALGEBRAIC STATISTICS 
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Abstract. Algcbraic statistics is concerned with the study of probabilistic models 
and techniques for statistical inference using methods from algebra and geometry. This 
article presents a list of open mathematical problems in this emerging field, with main 
emphasis on graphical models with hidden variables, maximum likelihood estimation, 
and multivariate Gaussian distributions. These are notes from a lecture presented at the 
IMA in Minneapolis during the 2006/07 program on Applications of Algebraic Geometry. 
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1. Introduction. This article is based on a lecture given in March 
2007 at the workshop on Statistics, Biology and Dynamics held at the In- 
stitute for Mathematics and its Applications (IMA) in Minneapolis as part 
of the 2006/07 program on Applications of Algebraic Geometry. In four 
sections we present mathematical problems whose solutions would likely 
become important contributions to the emerging interactions between al- 
gebraic geometry and computational statistics. Each of the four sections 
starts out with a "specific problem" which plays the role of representing the 
broader research agenda. The latter is summarized in a "general problem" . 

Algebraic statistics is concerned with the study of probabilistic models 
and techniques for statistical inference using methods from algebra and 
geometry. The term was coined in the book of Pistonc, Riccomagno and 
Wynn [25] and subsequently developed for biological applications in [24] . 
Readers from statistics will enjoy the introduction and review recently given 
by Drton and SuUivant [S], while readers from algebra will find various 
points of entry cited in our discussion and listed among our references. 

2. Graphical Models with Hidden Variables. Our first question 
concerns three-dimensional contingency tables (pijk) whose indices i,j,k 
range over a set of four clements, such as the set {A, C, G, T} of DNA bases. 

Specific Problem: Consider the variety of AxAx A-tables of tensor 
rank at most 4. There are certain known polynomials of degree at most 
nine which vanish on this variety. Do they suffice to cut out the variety? 

This particular open problem appears in [211 Conjecture 3.24], and it here 
serves as a placeholder for the foUowing broader direction of inquiry. 

General Problem: Study the geometry and commutative algebra of 
graphical models with hidden random variables. Construct these varieties 
by gluing familiar secant varieties, and by applying representation theory. 
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We are interested in statistical models for discrete data which can 
be represented by polynomial constraints. As is customary in algebraic 
geometry, we consider varieties over the field of complex numbers, with 
the tacit understanding that statisticians mostly care about points whose 
coordinates are real and non-negative. The model referred to in the Specific 
Problem lives in the 64-dimensional space 03 03 C* of 4x4x4-tables 
iPijk), where i,j,k G {A, C,G,T}. It has the parametric representation 

Pijk = Pki ■ <^kj ■ Qkk + Pa ■ CTcj ■ dck+ ,2 \\ 

PGi ■ <JGj ■ dak + Pli ■ CTTj • ÖTfc. 

Our problem is to compute the homogeneous prime ideal / of all polynomi- 
als which vanish on this model. The desired ideal / lives in the polynomial 
ring Q[paaa,Paac,Paat, ■ ■ • ,Pttg,Pttt] with 64 unknowns. In principle, one 
can compute generators of / by applying Gröbner bases methods to the 
parametrization (|2.ip . However, our problem has 64 probabilities and 48 
parameters, and it is simply too big for the kind of computations which 
were performed in [24l §3.2] using the software package Singular |13j . 

Given that Gröbner basis methods appear to be too slow for any prob- 
lem size which is actually relevant for real data, skeptics may wonder why 
a statistician should bother learning the language of ideals and varieties. 
One possible response to the practitioner's legitimate question "Why (pure) 
mathematics?" is offered by the foUowing quote due to Henri Poincaré: 

"Mathematics is the Art of Giving the Same Name to Different Things". 

Indeed, our prime ideal / gives the same name to the foUowing things: 

• the set of 4x4x4-tables of tensor rank < 4, 

• the mixture of four models for three independent random variables, 

• the naive Bayes model with four classes, 

• the conditional independence model [Xi IL X2 IL \Y], 

• the fourth secant variety of the Segre variety P'^xP'^xP^, 

• the general Markov model for the phylogenetic tree 

• superposition of four pure states in a quantum system [H [14] . 
These different terms have been used in the literature for the geometrie 
object represented by (|2.ip . The concise language of commutative algebra 
and algebraic geometry can be an effective channel of communication for 
the different communities of statisticians, computer scientists, physicists, 
engineers and biologists, all of whom have encountered formulas like (|2.ip . 

The generators of lowest degree in our ideal / have degree five, and 
the known generators of highest degree have degree nine. The analysis of 
Landsberg and Manivel in [2Ü1 Proposition 6.3] on 3x3x4-tables of tensor 
rank four implies the existence of additional ideal generators of degree six 
in I. This analysis had been overlooked by the authors of [24 when they 
formulated their Conjecture 3.24. Readers of [24l Chapter 3] are herewith 
kindly asked to replace "of degree 5 and 9 " by "of degree at most 9 ". 
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In what foUows we present the known minimal generators of degree five 
and nine in our prime ideal /, and we postpone a more detailed discussion 
of the Landsberg-Manivel sextics in [201 Proposition 6.3] to a future study. 

Considcr any 3 x 4 x 4-subtable (pijk) and let A, B, C be the 4x4-slices 
gotten by fixing i. To be precise, the entry of the 4 x 4-matrix A in row 
j and column k equals PAjk, the entry of B in row j and column k equals 
Pcjk, and the entry of C in row j and column k equals pGjk- We can 
check that the foUowing identity of 4x4-matrices holds for all tables in our 
model, provided the matrix B is invertible: 

A-B-^ -C = C B-^ A 

After clearing the denominator det(_B), we can write this identity as 

A ■ &A]{B) ■ C - C ■ &A]{B) ■ A = O, (2.2) 

where adj(i?) = (\.et{B) ■ B~^ is the adjoint matrix of B. The matrix entries 
on the left hand side give 16 quintic polynomials which lie in our prime ideal 
/. Each matrix entry is a polynomial with 180 terms which involve only 
30 of the 64 unknowns. For example, the upper left entry looks like this: 

PaacPccaPcggPcttPgaa — PaacPccaPcgtPctgPgaa 
— PaacPccgPcgaPcttPgaa + PaacPcctPcgaPctgPgaa 
+ (175 terms) - PataPcagPcccPcgaPgat- 

We note that thcre are no non-zero polynomials of degree < 4 in the 
ideal /. This foUows from general results on secant varieties [5] I17j. 

An explicit linear algebra computation reveals that all polynomials 
of degree five in I are gotten from the above construction by relabeling 
and considering all subtables of format 3x4x4, format 4x3x4 and for- 
mat 4x4x3, and by applying the natural action of the group GL{^y^) x 
GL{<C^) X GL(C'*) on 4x4x4-tables. This action leaves the ideal / fixed. 
We identity the representation of this group on the space of quintics in /. 

Proposition 2.1. The space oj quintic polynomials in the prime ideal 
I of V2.1\) kas dimension 1728. As a GL{C^)^ -module, it is isomorphic to 

531l(C4)®^211l(C4)® 52111 (C4) 
© 52111 (C4) ^311 (C4) ® 52111 (C*) 
© 5211l(C4)®5211l(C4)®531l(C4). 

Here S'a(C^) denotes the Schur modules which are the irreducible rep- 
resentations of GL{£,'^). We refer to [lOj for the relevant basics on repre- 
sentation theory of the general linear group, and to [TTl [TH [19] for more 
detailed Information about the specific modules under consideration here. 
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The known invariants of degree nine are also obtained by a similar 
construction. Consider any 3 x 3 x 3-subtable (pijk) and denote the three 
shces of that table hy A, B and C. We now consider the 3 x 3-determinant 

det{A- B-^ -C - C ■ B-^ ■ A). (2.3) 

The denominator of the rational function (|2.3p is det(i?)^ and not det(i?)'^ 
as one might think on first glance. The numerator of (|2.3p is a homogeneous 
polynomial of degree nine with 9216 terms which remains invariant under 
permuting A, B and C'. This homogeneous polynomial of degree nine lies 
in the ideal / and is known as the Strassen invariant. 

Proposition 2.2. The GL^C^)^ -submodule of the degree 9 component 
/g generated hy the Strassen invariant is not contained in the ideal (I^) 
generated hy the quintics in Proposition \2. 1\ This module has vector space 
dimension 8000 and it is isomorphic to the representation 

^333(C4)®5333(C4)®^333(C*). 

The first appearance of the Strassen invariant in algebraic statistics 
was [11] Proposition 22]. A conceptual study of the matrix construction 
AB~^C — CB^^A was undertaken by Landsberg and Manivel in [TB]. 

The Specific Problem at the beginning of this section plays a pivotal 
role also in algebraic phylogenetics [U [2 |3]. Our model (|2.ip is known 
there as the general Markov model on a tree with three leaves branching off 
directly from the root. AUman and Rhodes p| §6] showed that phylogenetic 
invariants which cut out the general Markov model on any larger binary 
rooted tree can be constructed from the generators of our ideal / by a gluing 
process. The invariants of degree five and nine arising from (|2.2p and (|2.3p 
are therefore basic building blocks for phylogenetic invariants on arbitrary 
trees whose nodes are labeled with the four letters A, C, G and T. 

In her lecture at the same IMA conference in March 2007, Elizabeth 
Allman [T] offered an extremely attractive prize for the resolution of the 
Specific Problem. She offered to personally catch and smoke wild salmon 
from the Copper River, located in her "backyard" in Alaska, and ship it to 
anyone who will determine a minimal generating set of the prime ideal /. 

In Propositions 12.11 and 12.21 we emphasized the language of represen- 
tation theory in characterizing the defining equations of graphical statis- 
tical models. This methodology is a main focus in the forthcoming book 
by J.M. Landsberg and Jason Morton, which advocates the idea of using 
Schur modules S'a(C") in the description of such models. Morton's key 
insight is that this naturally generalizes conditional independence, the cur- 
rent language of choice for characterizing graphical models. Conditional 
independence statements can be interpreted as a convenient shorthand for 
large systems of quadratic equations; see [Hl §4.1] or [27l Proposition 8.1]. 
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In the absence of hidden random variables, the quadratic equations ex- 
pressed impUcitly by conditional independence are sufScient to characterize 
graphical models. This is the content of the Hammersley-Clifford Theorem 
(see e.g. [12, Theorem 4.1] or [Ml Theorems 1.30 and 1.33]). However, 
when some of the random variables in a graphical model are hidden then 
the situation becomes much more complicated. We believe that representa- 
tion theory of the general linear group can greatly enhance the conditional 
independence calculus which is so widely used by graphical models experts. 
The representation-theoretic notation was here illustrated for a tiny graph- 
ical model, having three observed random variables and one hidden random 
variable, all four having the same state space {A, C, G, T}. 

3. Maximum Likelihood Estimation. In this section wc discuss 
topics concerning the algebraic approach to maximum likelihood estimation 
[M^ §3.3]. The foUowing open problem was published in [16, Problem 13]. 

Specific Problem: Find a geometrie charaeterization of those projective 
varieties whose maximum likelihood degree (ML degree) is equal to one. 
This question and others raised in [6l[T^ are just the tip of an iceberg: 

General Problem: Study the geometry of maximum likelihood estimation 
for algebraie statistical models. 

Here algebraic statistical models are regarded as projective varieties. 
A model has ML degree one if and only if its maximum likelihood estimator 
is a rational function of the data. Models which have this property tend to 
be very nice. For instance, in the special context of undirected graphical 
models (Markov random fields), the property of having ML degree one is 
equivalent to the statement that the graph is decomposable [T2l Theorem 
4.4]. For toric varieties, our question was featured in [27l Problem 8.23]. 

It is hoped that the ML degree is related to convergence properties of 
numerical algorithms used by statisticians, such as iterative proportional 
scaling or the EM algorithm, but no systematic study in this direction has 
yet been undertaken. In general, we wish to learn how statistical features 
of a model relate to geometrie properties of the corresponding variety. 

Here are the relevant definitions for our problems. We fix the complex 
projective space P" with coordinates (po '■ Pi ■ ■ ■ ■ ■ Pn)- The coordinate pi 
represents the probability of the ith event. The n-dimensional probability 
simplex is identified with the set P"q of points in P" which have non- 
negative real coordinates. The data comes in the form of a non-negative 
integer vector (uq, mi, . . . , u„) G N"^^. Here Ui is the number of times the 
ith event was observed. The corresponding likelihood function is defined as 

r / \ PO PI P2 Pn /O 1 ^ 

L{p„,pi,...,p„) = ■ ■ ■ N,,„.„, ■ (3.1) 

Statistical computations are typically done in afline n-space specified by 
Po+Pi + ' ■ '+Pn = Ij where the denominator of L can be ignored. However, 
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the denominator is needed in order for L to be a well-defined rational 
function on P". The unique critical point of the hkehhood function L is at 
{u{) : ui : ■ ■ ■ : Un), and this point is the global maximum of L over P>q. 
By a critical point we mean any point at which the gradiënt of L vanishes. 

An algebraic statistical model is represented by a subvariety A4 of the 
projective space P". The model itself is the intersection of Ai with the 
probabihty simplex P>q. The ML degree of the variety A4 is the number of 
complex critical points of the restriction of the likelihood function L io Ai. 
Here we disregard singular points oï Ai, we only count critical points that 
are not poles or zeros of L, and mq, wi, . . . , are assumed to be generic. If 
Ai is smooth and the divisor on Ai defined by L has normal crossings then 
there is a geometrie characterization of the ML degree, derived in the paper 
[6] with Catanese, Ho§ten and Khetan. The assumptions of smoothness and 
normal crossing are very restrictive and almost never satisfied for models 
of statistical interest. In general, to understand the ML degree will require 
invoking some resolution of singularities and its algebraic underpinnings. 

We illustrate the computation of the ML degree for the case when Ai 
is a plane curve. Here n = 2 and Ai is the zero set of a homogeneous 
polynomial F{pQ,pi,p2). Using Lagrange multipliers or [16|, Proposition 
2], we derive that the condition for (jjq : pi : ^2) to be a critical point of 
the restriction oi L to A4 is equivalent to the system of two equations 

(-"O Po pa-dF/dpo\ 
ui pi pi ■ dF/dpi = 0. 
U2 P2 P2 ■ dF/ dp2 ) 

For a general polynomial F of degree d, these equations will have d{d + 1) 
Solutions, by Bézout's Theorem. Moreover, all of these solutions satisfy 

Po -Pi • ■ -Pn • (Po +Pi H l-Pn) 7^ O, (3.2) 

and we conclude that the ML degree of a general plane curve of degree d is 
equal to d{d + 1). However, that number can drop considerably for special 
curves. For instance, while the ML degree of a general plane quadric equals 
six, the special quadric {p\ = \p0P2} has ML degree two for A ^ 4, and it 
has ML degree one for A = 4. Thus, returning to the Special Problem, our 
first example of a variety of ML degree one is the plane curve defined by 

- - Ht >:)- 

Biologists know this as the Hardy- Weinherg curve, with the parametrization 
Po = , pi = 2Ö(1 - Ö) , P2 = (1 - öf. (3.4) 
The unique critical point of the likelihood function L on this curve equals 



{{2uQ + uif : 2(2uo+-ui)(ui+2u2) : (mi + 2u2)^). 
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Determinantal varieties arise naturally in statistics. They are the mod- 
els M that are specified by imposing rank conditions on a matrix of un- 
knowns. A first example is the model 113. 4p for two i.i.d. binary random 
variables. For a second example we consider the general 3 x 3-matrix 

/poo Poi P02^ 
P = Pio Pil Pi2 I (3.5) 

\P20 P21 P22 / 

which represents two ternary random variables. The independence model 
for these two random variables is the variety of rank one matrices. This 
model also has ML degree one, i.e., the maximum likelihood estimator is a 
rational function in the data. It is given by the 3 x 3-matrix whose entry 
in row i and column j equals (wio + «ii + Ui2) ■ {uoj + uij + U2j)- 

By contrast, consider the mixture model based on two ternary random 
variables. It consists of all matrices P of rank at most two. Thus this 
model is the hypersurface defined by the cubic polynomial F — det(P). 
Explicit computation shows that the ML degree of this hypersurface is ten. 
In general, it remains an open problem to find a formula, in terms of m, n 
and r, for the ML degree of the variety of m x n-matrices of rank < r. 

The first interesting case arises when m ~ n = A and r = 2. At present 
we are unable to solve the likelihood equations for this case symbolically. 
The following concrete biology example was proposed in [211 Example 1.16]: 

"Our data are two aligned DNA sequences ... 

ATCACCAAACATTGGGATGCCTGTGCATTTGCAAGCGGCT 
ATGAGTCTTAAACGCTGGCCATGTGCCATCTTAGACAGCG 

.. test the hypothesis that these two sequences were generated by DiaNA 
using one biased coin and Jour tetrahedral dice...." 

Here the model 7W consists of all (positive) 4x4- matrices (pij) of rank 
at most two. In the given alignment, each match occurs four times and each 
mismatch occurs two times. Hence the likelihood function p.ip equals 

i i^i ij 

Based on experiments with the EM algorithm, we conjectured that the 
f3 3 2 2\ 
3 3 2 2 
2 2 3 3 
V2 2 3 3/ 

function L. In the Nachdiplomsverlesung (postgraduate course) which I 
held at ETH Zürich in the summer of 2005, I offered a cash prize of 100 
Swiss Francs for the resolution of this very specific conjecture, and this 
prize remains unclaimed and is still available at this time (August 2007). 
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matrix (py) = ^ 



is a global maximum of the likelihood 



The state of the art on this 100 Swiss Francs Conjecture is the work of 
Hersh which originated in March 2007 at the IMA. She proved a range of 
constraints on the maximum hkehhood estimates of determinantal models, 
especially when the data Uij have symmetry. A discussion of these ideas 
appears in Hersh's paper with Fienberg, Rinaldo and Zhou [9j . That paper 
gives an exposition of MLE for determinantal models aimed at statisticians. 

4. Gaussian Conditional Independence Models. The early lit- 
erature on algebraic statistics, including the book 24J, dealt primarily with 
discrete random variables (binary, ternary,. . .). The set-up was as described 
in the previous two sections. We now shift gears and consider multivariate 
Gaussian distributions. For continuous random variables, we must work in 
the space of model parameters in order to apply algebraic geometry. The 
foUowing concrete problem concerns Gaussian distributions on R'^. 

Specific Problem: Which sets of almost-principal minors can be zero for 
a positive definite symmetrie 5x5-matrix? 

The general question behind this asks for characterization of all con- 
ditional independence models which can be realized by Gaussians on R". 

General Problem: Study the geometry of conditional independence mod- 
els for multivariate Gaussian random variables. 

The state of the art on these problems appears in the work of Frantisek 
Matüs and hls collaborators. In particular, Matüs' recent paper with 
Lnënicka [20j on representation of gaussoids solves our Specific Problem 
for symmetrie 4x4-matrices. Sullivant's construction in [28] complements 
that work. For more information see also the article by Simecek [26] . 

Let us begin, however, with some basic definitions. Our aim is to 
discuss these problems in a self-contained manner. A multivariate Gaussian 
distribution on M" with mean zero is specified by its covariance matrix 
S = ((Tij). The nxn- matrix E is symmetrie and it is positive definite, 
which means that all its 2" principal minors are positive real numbers. 

An almost-principal minor of S is a subdeterminant which has row 
indices {«} U K and column indices {j} U K for some K C. {1, . . . , n} and 
ï,j S {1, . . . , We denote this subdeterminant by [i_LLj|i^]. For 

example, if n = 5, i — 2,j — 4 and K ~ {I75} then the corresponding 
almost-principal minor of the symmetrie 5x5-niatrix E equals 



Our notation for almost-principal minors is justified by their intimate 
connection to conditional independence, expressed in the following lemma. 
We note that the almost-principal minors are referred to as partial covari- 
ance (or, if rcnormalized, partial correlations) in the statistics literature. 
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Lemma 4.1. The subdeterminant [i IL j \K] is zero for a positive 
definite symmetrie nxn-matrix S if and only if, for the Gaussian random 
variable X on R" with covariance matrix E, the random variatie Xi is 
independent of the random variable Xj given the joint variable Xk ■ 

Proof See 7, Equation(5)], [22l Section 1], or [28l Proposition 2.1]. □ 

Let PD„ denote the ("^^)-dimensional cone of positive definite sym- 
metrie nxn-matrices. Note that this cone is open. A Gaussian conditional 
independence model, or GGI model for short, is any semi-algebraic subset 
of the cone PD,i which can be defined by polynomial equations of the form 

[iAL3\K] = 0. (4.1) 

In algebraic geometry, we simphfy matters by studying the complex alge- 
braic varieties defined by equations of the form (|4.ip . Of course, what we 
are particularly interested in is the real locus of such a complexified GCI 
model, and how it intersects the positive definite cone PD„ and its closure. 

As an illustration of algebraic reasoning for Gaussian conditional in- 
dependence models, we examina an example taken from |28) . Let n = 5 
and consider the GCI model given by the five quadratic polynomials 



[IIL 2 
[21L 3 
[3_U_ 4 
[4_U_ 5 
fSJL 1 



{3}] = cri2Cr33 — cri3tT23 

{4}] = 0-23 0-44 — (724 0'34 

{5}] = cr34CT55 — Cr35f745 

{!}] = cr450'ii — ctmCis 

{2}] — cri5a'22 — Cr25Cri2 



This variety is a complete intersection (it has dimension ten) in the 15- 
dimensional space of symmetrie 5x5-matrices. Primary decomposition re- 
veals that it is the union of precisely two irreducible components, namely, 

• the linear space { (T12 = (T23 = cr34 = (T45 = ctis = O }, and 

• the toric variety defined by the five quadrics plus the extra equation 

0'll(T22(T33Cr44(T55 = Cri30-i4(T240'250'35 . (4.2) 

AU matrices in the open cone PD5 satisfy the inequalities au > O and 

fllCaS > 0'i3 , cr22a'44 > ^24 : CTSSCTSS > Cr^g , Cr440'ii > (714 7 <^5b<^22 > Cr^S- 

Multiplying the left hand sides and right hand sides respectively, we find 

22222 22222 
f^ll'^22f^33'^44C'55 > Cri3C'l4C'24C^25C^35 • 

This is a contradiction to the equation (|4.2p . and we conclude that the 
intersection of our GCI model with PD5 is contained in the linear space 
{ (7i2 = (723 = C34 — (745 = (7i5 =0}. The vanishing of the off-diagonal 
entry aij means that Xi is independent of Xj, or, in symbols, [iiLj]. Our 
algebraic computation thus implies the foUowing axiom for GCI models. 
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COROLLARY 4.1. Suppose the conditional independence statements 
[1^2|{3}], [2^3|{4}], [3^4|{5}], [4^5|{1}], [5^1|{2}] 
hold for some multivariate Gaussian distribution. Then also the following 
five statements must hold: [1-U_2], [2_U_3], [3_U_4], [4_U_5] and [5_U_1]. 

Let us now return to the question "which almost-principal minors 
can simultaneously vanish for a positive definite symmetrie n x n-matrix?" 
CoroUarv 14.11 gives a necessary condition for n = 5. We next discuss the 
answer to our question for n < 4. For n = 3, the necessary and sufficiënt 
conditions are given (up to relabeling) by the foUowing four axioms: 

(a) [ 1 _U_ 2 ] and [ 1 _U_ 3 I {2} ] implics [ 1 ü 3 ] and [ 1 _U_ 2 | {3} ] , 

(b) [liL 2 I {3}] and [liL 3 I {2}] imphes [ 1 _U_ 2 ] and [ 1 _U_ 3 ] , 

(c) [1_U_2] and [1_U_3] imphes [ 1 IL 2 | {3} ] and [ 1 _U_ 3 | {2} ] , 

(d) [1_U_ 2] and [1_U_ 2 I {3}] imphes [1_U_ 3] or [2_U_ 3] . 

The necessity of these axioms can be checked by simple calculations involv- 
ing almost-principal minors of positive definite symmetrie 3x3-matrices: 

(a) 0-12 = Cri3(T22 - (T120-23 = O implicS (Ti3 = 0-12(733 - Cri3(T23 = O, 

(b) cri2(T33 - (T130-23 = 0-130-22 - 0-12(723 = O implicS O-12 = O-13 = O, 

(c) 0-12 = 0-13 = O implies 0-120-33 - 0-130-23 = 0-130-22 - 0-120-23 = O, 

(d) 0-12 = 0-120-33 - 0-130-23 = O imphes 0-13 = O or 0-23 = 0. 
The sufficiency of these axioms was noted in [351 Example 1] . 

For arbitrary n > 3, a collection of almost-principal minors is called 
a gaussoid if it satisfies the axioms (a)-(d), after relabeling and applying 
Schur complements. For instance, axiom (a) is then written as follows: 
[z_U_ j \ L] and [iALk |{j}UL] implies [iALk \ L] and [i_U_ j \ {k}\JL]. 
This axiom is known as the semigraphoid axiom. See [23] for a discussion. 

A gaussoid is representable if it is the set of vanishing almost-principal 
minors of some matrix in PD„. For n = 3 every gaussoid is representable 
by [22I Example 1]. For n = 4, a complete classification of the representable 
gaussoids was given in [2Ü]. We are here asking for the extension to n = 5. 

We now introducé a conceptual framework for our General Problem. 
For each subset S of {1, 2, . . . ,n} we introducé one unknown Hs, and we 
define the submodular cone to be the solution set in of the system of 
linear inequalities 

H{i}uK + H^jyuK < -ff{i,i}uK + Hk, (4.3) 

where K is any subset of {1, ... , n} and i,j G {1, . . . , n}\K. We denote 
this cone by SubMod„ C . Note that SubMod„ is a polyhedral cone 
living in a high-dimensional space while PD„ is a non-polyhedral cone in a 
low-dimensional space. Between these two cones we have the entropy map 

H : PD„ SubMod„, 

which is given by the logarithms of all 2" principal minors of a positive 
definite matrix E = ((7^ ) . Namely, the coordinates of the entropy map are 



iï(E)/ = -logdet(S/), 
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where / is any subset of {l,...,7i} and S/ the corresponding principal 
minor. Note that the entropy map is weU-defined because of the inequality 

det(E{i}u/f ) • det(I]{j}uK) > det(E{,j|uK) • det(SK). (4.4) 

A matrix S e PD„ satisfies (I4.ip if and only if equahty holds in (|4.4p if and 
only if equahty holds in (|4.3p . This imphes the foUowing resuh. 

Proposition 4.1. The Gaussian condüional independence models are 
those subsets of the positive definite cone PD„ that arise as inverse images 
of the faces of the submodular cone SubMod„ under the entropy map H . 

The importance of the submodular cone for probabilistic inference with 
discrete random variables was highlighted in 23]. Here we are concerned 
with Gaussian random variables, and it is the geometry of the entropy map 
which we must study. We can thus paraphrase our problem as foUows. 

General Problem: Characterize the image of the entropy map H and 
how it intersects the various faces of SubMod„ . Study the fibers of this map. 

One approach to this problem is to work with the algebraic equations 
satisfied by the principal minors of a symmetrie matrix. A characteriza- 
tion of these relations in terms of hyperdeterminants was proposed in |15| . 
What we are interested in here is the logarithmic image (or amoeba) of the 
positive part of the hyperdeterminantal variety of [15j . A reasonable first 
approximation to this amoeba is the tropicalization of that variety. More 
precisely, we seek to compute the positive tropical variety [241 §3.4] para- 
metrically represented by the principal minors of a symmetrie Tixrj-matrix. 

5. Bonus Problem on Rational Points. Section 4 dealt with con- 
ditional independence (Cl) models for Gaussians. Our bonus problem con- 
cerns Cl models for discrete random variables, thus returning to the setting 
of Section 2. Consider n discrete random variables ATi, A"2, . . . , Ar„ with 
di,d2, ■ ■ ■ ,dn States. Any collection of Cl statements Xi _LL Xj \Xk speci- 
fies a determinantal variety in the space of tables 

C''' (g) C''^ (g) ■ ■ ■ (g) C^" . (5.1) 

We call such a variety a Cl variety. It is the zero set of a large collection of 
2x2-determinants. These constraints are well-known and listed explicitly 
in [121 §4.1] or [571 Proposition 8.1]. The corresponding strict Cl variety 
is the set of tables for which the given Cl statements hold but all other Cl 
statements do not hold. Thus a strict Cl variety is a constructible subset 
of ()5.1|) which is Zariski open in a Cl variety. The corresponding strict 
Cl model is the intcrsection of the strict Cl variety with the positive or- 
thant . It consists of all positive di x (i2 x ■ • • x d„-tables that lie in a common 
equivalence class, where two tables are equivalent if precisely the same Cl 
statements XiAL Xj \Xk are valid (resp. not valid) for both tables. 

Bonus Problem: Does every strict Cl model have a Q-rational point? 
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This charming problcm was proposed by F. Matüs in [21] page 275]. It 
suggests that algebraic statistics has something to offer also for arithmetic 
geometers. One conceivable solution to the Bonus Problcm miglit say that 
Cl models with no rational points exist but that rational points always 
appear when the number of states grows large, that is, for di , c?2 , ■ • • , ^ 
0. But that is pure speculation. At present we know next to nothing. 

6. Brief Conclusion. This article offered a whirlwind introduction to 
the emerging field of algebraic statistics, by discussing a few of its numerous 
open problems. Aside from the Bonus Problem above, we had listed three 
Specific Problems whose solution might be particularly rewarding: 

• Consider the variety of 4x4x4-tables of tensor rank at most 4. Do 
the known polynomial invariants of degree at most nine suffice to 
define this variety? Set-theoretically? Ideal-theoretically? 

• Characterize all projective varieties whose maximum likelihood de- 
gree is equal to one. 

• Which sets of almost-principal minors can be simultaneously zero 
for a positive definite symmetrie 5x5-matrix? 
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